MRST: A New Technique For Information Summarization

نویسندگان

  • Afnan Ullah Khan
  • Shahzad Khan
  • Waqar Mahmood
چکیده

summarization is defined as " The process of concisely restating the essential ideas of a text or passage, and synthesizing the ideas into an overarching, idea ". There is an increasing need of coming up with ideas that can perfectly generate summaries as there is large repository of data present online but to get to the right information is indeed very difficult. In order to achieve this task many summarizing techniques have been devised such as LEAD, MEAD and RANDOM. The paper proposes a new technique for information summarization that basically combines the rhetorical structure theory with the MEAD summarizer system as Mead is based totally based on mathematical calculation and lack a knowledge base Rhetorical Structure Theory is used to overcome this weakness and in the end the new summarizer system is evaluated against the original MEAD summarizer system exploit mainly two areas of information that are Financial Articles and Medline abstracts. In theory MRST should be better than Mead but in practice Mead came ahead of MRST and that's merely because of one reason and that is there is no true parser that completely implements the Rhetorical Structure Theory. The results show that Mead produces successful summaries 75% time for both short and long documents. Incase of MRST it produces successful summaries for short documents 70% of the time and for long documents it produces successful summaries 65% of the time, as the size of the document increases the performance of MRST deteriorate. The main finding of the work is if we could come up with a parser that comprehensively implement the rhetorical structure theory then we would be able to come up the summarizer system that would be better then MEAD.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization

    Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...

متن کامل

ایجاز:یک سامانه عملیاتی برای خلاصه‌سازی تک‌سندی متون خبری فارسی

The rapid growth of published documents on the web has created some new requests for processing, classification and information retrieval. So, the use of natural language processing tools has increased around the world. Automatic summarization known as the core of a wide range of text-processing tools such as decision systems, accountability systems, search engines, etc. And always has been inv...

متن کامل

Graph Hybrid Summarization

One solution to process and analysis of massive graphs is summarization. Generating a high quality summary is the main challenge of graph summarization. In the aims of generating a summary with a better quality for a given attributed graph, both structural and attribute similarities must be considered. There are two measures named density and entropy to evaluate the quality of structural and at...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005